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Abstract 

A  classical  method  for  finding  a  point  in  the  intersection  of  a  finite  collection  of  closed 
convex  sets  is  the  successive  projection  method.  It  is  well-known  that  this  method  is  convergent  if 
each  convex  sets  is  chosen  for  projection  in  a  cyclical  manner.  In  this  note  we  show  that  this  method 
is  still  convergent  if  the  length  of  the  cycle  grows  without  bound,  provided  that  the  growth  is  not  too 
fast.  Our  argument  is  based  on  an  interesting  application  of  the  Cauchy-Schwartz  inequality. 
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1.  Introduction 

A  fundamental  problem  in  convex  programming  is  that  of  finding  a  point  in  the  intersection 
of  a  finite  collection  of  closed  convex  sets.  This  problem  has  many  application  areas,  including 
image  reconstruction  [SSW77],  [HerSO],  linear  prediction  theory,  multigrid  methods,  computed 
tomograph  [Deu85],  optimal  control  [GPR67],  machine  learning  [MiP88],  and  linear/quadratic 
programming  [Bre65].  One  classical  method  for  solving  this  problem  is  the  successive  projection 
method,  whereby  an  arbitrary  starting  point  is  successively  projected  onto  the  individual  convex  sets 
to  generate  a  sequence  of  points  converging  to  a  solution.  This  method  was  first  proposed  by 
Kacmarz  [Kac37]  for  the  special  case  where  the  sets  are  linear  varieties  (i.e.,  translates  of 
subspaces),  and  was  rediscovered  by  von  Neumann  [voN50],  by  Agmon  [Agm54],  and  by  Motzkin 
and  Schoenberg  [MoS54].  (Also  see  [Deu85],  [GofSO],  [Gof82],  [Hal62],  [Man84],  [Mer62], 
[SSW77],  [Tan71]  for  more  detailed  treatments  of  the  linear  case,  including  rate  of  convergence 
analysis  [Agm54],  [Deu85],  [Gof80],  [Gof82],  [Man84],  [Mer62],  [SSW77]  and  finite  convergence 
analysis  [MoS54],  [Gof80],  [Gof82].)  Extensions  of  this  method  to  arbitrary  convex  sets  are 
discussed  in  [Bre65],  [Ere65],  [Ere66],  [GeS66],  [GPR67],  [Pol69],  amongst  which  the  analysis 
given  in  [GPR67]  is  the  most  extensive.  This  method  can  also  be  applied  to  problems  in  a  product 
space  to  obtain  a  highly  parallelizable  method  of  barycentres  [Pie84]. 

In  all  the  existing  successive  projection  methods,  the  sets  are  chosen  for  projection  either  in 
an  essentially  cyclic  order  (i.e.,  every  set  is  chosen  at  least  once  every  B  iterations,  for  some  fixed  B 
>  0)  or  according  to  a  maximal  distance  rule  (i.e.,  choose  a  set  that  is  in  some  sense  farthest  away 
fi-om  the  current  point).  (Numerical  evidence  suggests  that  the  essentially  cyclic  order  is  perhaps  the 
more  efficient  [Man84].)  In  this  note  we  show  that  the  essentially  cyclic  order  can  be  further 
extended  to  one  that  allows  the  length  of  each  cycle,  namely  B,  to  increase  without  bound,  provided 
that  the  rate  of  increase  is  not  too  fast.  This  extension,  apart  firom  its  theoretical  appeal,  has  the 
practical  advantage  that  it  allows  sets  for  which  projection  is  expensive  to  be  left  out  of  the 
computation  increasingly  more  often. 
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2.  Algorithm  Description 

Let  H  be  a  real  Hilbert  space  endowed  with  an  inner  product  {•,•)  and  let  INI  denote  the  norm 
induced  by  the  inner  product  (i.e.,  lixll  =  (x,x)).  Let  Cj,  C2, . . be  a  given  collection  of 

closed  convex  sets  in  H.  Our  problem  is  to  find  a  point  in  C  =  CinC2n. .  .nCjn.  We  make  the 
following  standing  assumption  regarding  the  Q's. 

Assumption  A  (feasibility):  CinC2ri.  •  ^  0. 

Assumption  A  is  fairly  standard.  Some  results  for  the  case  where  the  Q's  do  not  intersect  are  given 
in  [GPR67]. 

We  describe  the  successive  projection  method  below.  In  this  method,  we  begin  with  an 
arbitrary  x(0)  in  Ji  and  we  generate  a  sequence  of  points  {x(0),  x(l), ...}  in  H  according  to  the 
iteration: 

y(t)  =  argmin{  lly-x(t)ll  ly€Co(t) },  (la) 

x(t+l)  =  co(t)y(t)  +  (l-co(t))x(t),  (lb) 

where  o(t)  is  an  element  of  { 1,2, . .  .,m}  (a(t)  specifies  the  set  onto  which  projection  is  made  at  the  t- 
th  iteration),  and  co(t)  is  a  scalar  (called  a  relaxation  parameter)  satisfying 

e  <  co(t)  <  2  -  e,  V  t,  (Ic) 

for  some  fixed  ee  (0,1].  The  relaxation  mechanism  co(t)  was  first  introduced  in  [Agm54]  and  in 
[MoS54].  It  has  been  observed  that,  in  certain  cases,  a  value  of  co(t)  different  from  one  (i.e., 
under/over-relaxationl  can  significantly  improve  the  convergence  [GofSO],  [HerSO],  [Man84]. 

The  iterates  {x(t) }  in  general  do  not  converge  unless  we  impose  certain  restrictions  on  the 
order  in  which  projection  onto  the  sets  Ci,  C2, . • Cm  are  made.  We  will  consider  the  following 
order  of  projection,  introduced  in  [TsB87]: 
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Quasi-Cyclic  Order:  There  exists  a  sequence  of  integers  [Xi,  t2.  •••)  satisfying 


ti  =  0, 


'Ck+i  -  tk  S  m  V  k. 


£l/(Tk+i-tk)  = 


k=l 


(2a) 


for  which 

{1,2,  c  {a(Xk),  a(Xk+l), ....  a(Xk+i-l)},  V  k.  (2b) 

Roughly  speaking,  the  quasi-cyclic  order  (of  projection)  states  that  every  set  Q  must  be 
chosen  for  projection  at  least  once  between  the  Xk-th  and  the  (Xk+i-l)-st  iteration  (called  the  k-th 
quasi-cycle)  for  all  k  [cf.  (2b)],  and  that  the  length  of  the  k-th  quasi-cycle,  namely  Xk+i  -  Xk,  cannot 
grow  too  fast  with  k  [cf.  (2a)].  One  particular  choice  of  the  Xk's,  namely  Xk  =  m(k-l)  for  all  k, 
gives  rise  to  the  well-known  cyclic  order  (of  projection),  for  which  o(t)  =  t  (mod  m)  -l- 1  for  all  t  (and 
the  length  of  each  quasi-cycle  is  exactly  m).  A  more  interesting  choice  of  the  Xk's  is  given  by 

Xk+i  =  Xk  +  k  m,  V  k. 


for  which  the  length  of  the  k-th  quasi-cycle  increases  linearly  with  k. 
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3.  Convergence  Analysis 

Below  we  give  the  main  result  of  this  note. 

Theorem  1.  Let  {x(t)}  be  a  sequence  of  iterates  generated  by  (la)-(lc)  using  the  quasi-cyclic 
order  of  projecton.  Then,  {x(t)}  converges  in  the  weak  topology  to  a  point  x"  in  C. 

Proof:  Fix  any  xe  C.  For  any  nonnegative  integer  t,  we  have  from  (la)  that  y(t)  is  the  orthogonal 
projection  of  x(t)  onto  Since  xe  C  so  that  xe  (cf.  definition  of  C),  this  implies 

<y(t)-x(t),5E-y(t)>  >  0. 

Since  [cf.  (lb)] 

llx  -  x(t)ll^  =  llx  -  x(t+l)ll^  +  2o3(t)  (y(t)  -  x(t),  X  -  y(t))  +  co(t)  (2-0)(t))  lly(t)  -  x(t)ll^, 
this,  together  with  (Ic),  implies 

Hi  -  x(t)lP  >  Hi  -  x(t+l)l|2  +  e2|lx(t+l)  -  x(t)l|2  V  t  >  0,  (3) 

so  that  {x(t)}  is  bounded.  Hi  -  x(t)ll  is  nonincreasing  with  t,  and 

oo 

X  Hx(t+1)  -  x(t)lr  <  oo.  (4) 

t=0 

We  claim  that  there  exists  a  subsequence  3C  of  { 1,2,. . . }  for  which 
\+i 

X  Hx(t+1)  -  x(t)H  — >  0  as  k oo,  ke  3C.  (5) 

t=vl 

To  see  this,  suppose  that  such  a  subsequence  does  not  exist.  Then,  there  would  exist  a  positive 
scalar  5  and  an  integer  k  such  that 
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^k+l  _ 

5  ^  £  llx(t+l)-x(t)ll,  Vk>k. 

t=tk+l 

Since,  by  the  Cauchy-Schwartz  inequality,  there  holds 

- :  / - 

X  llx(t+l)  -  x(t)ll  <  A  /  £  llx(t+l)  -  x(t)ir  •  i  -  tj,  , 

t=Tk+l  V  t=Xk+l 


this  implies 


5^  <  £  ilx(t+l) - x(t)ll^  ('Ck+i-'Cfc),  Vk>k, 

t=Tk+l 


so  that 


CO 


6^  £  l/(tj,+i-Xk) 

k=k 


< 


“  ^  'k+l 
2  [  1 
k=i  t=tk+i 


llx(t+l)  -  x(t)ll^ 


] 


X  l!x(t+l)  -  x(t)li^.  (6) 

t=Tj^l 


The  left  hand  side  of  (6),  by  (2a),  has  the  extended  value  of  <»,  while  the  right  hand  side  of  (6) 
according  to  (4)  has  finite  value,  thereby  reaching  a  contradiction.  Hence,  (5)  holds  for  some 
3C  C  {1,2,...}. 

Let3C  be  any  subsequence  of  {1,2,...}  satisfying  (5).  Since  {x(t)}  is  bounded  [cf.  (3)], 
there  exist  some  x“e  Ji  and  some  JC  QIC  such  that 

{ x(X]j+l) } j,g  j,,  converges  weakly  to  x°°.  (7) 

We  claim  that  x°°e  C.  To  see  this,  fix  any  i€  { 1,2,. . .,m}.  Since  the  projections  are  in  the  quasi- 
cyclic  order,  then  for  each  integer  k  >  1  there  exists  some  p^e  {Xjj,  Xj^+l, . . .,  x^+i-l }  satisfying 
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a(pj^)  =  i  [cf.  (2b)].  By  using  the  triangle  inequality  together  with  the  fact  llx(p]5^+l)  -  y(Pk)ll  ^ 
(l/e-l)llx(pj5.+l)  -  x(pk)ll  for  all  k  >  1  [cf.  (lb),  (Ic)],  we  have 

llx(tk+l)  -  y(Pk)ll  <  X  llx(t+l)  -  x(t)ll  +  Ilx(pi.+l)-y(pi5.)ll 
t=T^+l 

\+i 

<  X  ilx(t+l)-x(t)ll  +  (l/e-l)llx(pk+l)-x(Pk)ll,  Vk>l.  (8) 

t=T,+l 

k 

Eqs.  (5),  (8)  together  with  {llx(t+l)  -  x(t)ll}  ->  0  [cf.  (4)]  imply 
lim  IIx(Xk+l)  -  y(pk)ll  =  0, 

k-»«>,kGX' 

which,  combined  with  (7),  yields 

^  (u,y(pk))  =  ^  (u,x(Xk+l)>  =  (u,x°°>,  V  ueJi, 

k->~,k€3C'  k->«>,kG3C' 


SO  that  {y(Pk))kex,'  converges  weakly  to  x“.  Since  y(pk)e Q  [cf.  cj(pk)  =  i  and  (la)]  for  all  k  and 
Cj  is  closed  and  convex,  this  shows  x°°e  Cj  [DuS66,  p.  422,  Theorem  3.13].  Since  the  choice  of  i 
was  arbitrary,  we  obtain  x“e  Q  for  all  i,  and  therefore  x^e  C. 

We  now  show  that  {x(t)}  has  a  unique  weak  limit  point.  Our  argument  follows  that  given  in 
[Bre65]  and  is  presented  here  for  completeness.  Suppose  that  {x(t)}  does  not  have  a  unique  limit 
point.  Then,  there  exist  Xi“e C,  X2“s C  with  ^  and  subsequences  {x(t)}tg  j^,  {x(t)}teT2 

converging  weakly  to,  respectively,  Xj^°°  and  X2“.  By  replacing  x  in  (3)  by  xi°°,  we  find  that  llxj^”^  - 
x(t)ll  is  nonincreasing  with  t  so  that  there  exists  a  scalar  such  that 

{ilXi“-x(t)lF} ->ai.  (9a) 


Similarly,  by  replacing  x  in  (3)  by  X2°°,  we  find  that  there  exists  a  scalar  02  such  that 
{ilx2~-x(t)l|2}  ^CX2. 


(9b) 
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Now,  for  any  te  we  have 

Ilx2“-x(t)ll^  =  Ilx2~-x“il^  +  2<x2“-x~,xi“-x(t)>+llx“-x(t)l|2, 

so  that,  by  letting  t  ->  co,  te  Ti,  we  obtain  from  (9a)-(9b)  and  the  weak  convergence  of  {x(t)  to 
Xi“  that  a2  =  Ilx2°°  -  Xi~il^  +  a^.  By  an  analogous  argument  with  the  role  of  x^”  and  X2“  reversed, 
we  also  obtain  =  llxj"  -  X2°°ll^  +  02-  Adding  these  two  relations  yields  0  =  llxi”  -  X2“l!^  and 
hence  Xi“  =  X2”,  a  contradiction.  Q.E.D. 


We  remark  that  strong  convergence  of  the  sequence  {x(t)}  generated  by  (la)-(lc),  using  the 
quasi-cyclic  order  of  projection,  can  also  be  established  under  the  same  set  of  conditions  on  the  Q's 
as  those  given  by  Gubin  et.  al.  [GPR67,  Theorem  1].  The  proof  of  this  follows  (in  a 
straightforward  manner)  from  combining  the  proof  of  Theorem  1  with  that  of  [GPR67,  Theorem  1], 
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